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December 1994 ACM Computing Surveys (CSUR), Volume 26 issue 4 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms , review 

In the last three decades a large number of compiler transformations for optimizing 
programs have been implemented. Most optimizations for uniprocessors reduce the 
number of instructions executed by the program using transformations based on the 
analysis of scalar quantities and data-flow techniques. In contrast, optimizations for high- 
performance superscalar, vector, and parallel processors maximize parallelism and 
memory locality with transformations that rely on tracking the properties o ... 

Keywords: compilation, dependence analysis, locality, multiprocessors, optimization, 
parallelism, superscalar processors, vectorization 


Implementation and performance of Munin 
John B. Carter, John K. Bennett, Willy Zwaenepoel 

September 1991 ACM SIGOPS Operating Systems Review , Proceedings of tlie 

thirteenth ACI^ symposium on Operating systems principles SOSP 

'91, Volume 25 Issue 5 
Publisher: ACM Press 

r- .. * 4 I ui 0 ^*/4^eiviiD^ Additional Information: full citation , abstract , references , citing s, index 

Full text available: TO pdf(1.46MB) ^ ^ 

^^-^^^ terms 

Munin is a distributed shared memory (DSM) system that allows shared memory parallel 
programs to be executed efficiently on distributed memory multiprocessors. Munin is 
unique among existing DSM systems in its use of multiple consistency protocols and in its 
use of release consistency. In Munin, shared program variables are annotated with their 
» expected access pattern, and these annotations are then used by the runtime system to 
choose a consistency protocol best suited to that acc ... 

System area network mapping 

Brent N. Chun, Alan M. Mainwaring, Saul Schleimer, Daniel S. Wilkerson 
June 1997 Proceedings of the ninth annual ACM symposium on Parallel algorithms 
and architectures 
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4 . The design and implementation of hierarchical software systems with reusable 

^ components 
^ Don Batory, Sean O'Malley 
' October 1992 ACM Transactions on Software Engineering and Methodology (TOSEM), 

Volume 1 Issue 4 
Publisher: ACM Press 
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Full text available: TO pdfO.ISMB) ^ 

terms , review 

We present a domain-independent model of hierarchical software system design and 
construction that is based on interchangeable software components and large-scale reuse. 
The model unifies the conceptualizations of two independent projects, Genesis and Avoca, 
that are successful examples of software component/building-block technologies and 
domain modeling. Building-block technologies exploit large-scale reuse, rely on open 
architecture software, and elevate the granularity of programming to ... 

Keywords: domain modeling, open system architectures, reuse, software building-blocks, 
software design 


6 A new approach to developing and implementing eager database replication 
protocols 

Bettina Kemme, Gustavo Alonso 

September 2000 ACM Transactions on Database Systems (TODS), volume 25 issue 3 
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i^^y^"—^ terms , review 

Database replication is traditionally seen as a way to increase the availability and 
performance of distributed databases. Although a large number of protocols providing data 
consistency and fault-tolerance have been proposed, few of these ideas have ever been 
used in commercial products due to their complexity and performance implications. 
Instead, current products allow inconsistencies and often resort to centralized approaches 
which eliminates some of the advantages of replication. As an ... 

Keywords: database replication, fault-tolerance, group communication. Isolation levels, 
one-copy-serializability, replica control, total error multicast 
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There are several replica control algorithms for managing replicated files in the face of 
network partitioning due to site or communication link failures. Pessimistic algorithms 
ensure consistency at the price of reduced availability; they permit at most one 
(distinguished) partition to process updates at any given time. The best known pessimistic 
algorithm, voting, is a ''static" algorithm, meaning that all potential distinguished partitions 
can be listed in adv ... 

7 Linearity and the pi-calculus 1 
Naoki Kobayashi, Benjamin C. Pierce, David N. Turner 

January 1996 Proceedings of the 23rd ACM SIGPLAN-SIGACT symposium on 
Principles of programming languages POPL '96 
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Parallel logic simulation of VLSI systems 

Mary L. Bailey, Jack V. Briner, Roger D. Chamberlain 

September 1994 ACM Computing Surveys (CSUR), Volume 26 issue 3 

Publisher: ACM Press 

Full text available- 19 odfO 74 MB) Additional Information: full citation , abstract , references , citings, index 

Fast, efficient logic simulators are an essential tool in modern VLSI system design. Logic 
simulation is used extensively for design verification prior to fabrication, and as VLSI 
systems grow in size, the execution time required by simulation Is becoming more and 
more significant. Faster logic simulators will have an appreciable economic impact, 
speeding time to market while ensuring more thorough system design testing. One 
approach to this problem is to utilize parallel processing, taking ... 

Keywords: circuit structure, parallel architecture, parallelism, partitioning, 
synchronization algorithm, timing granularity 


An adaptive data replication algorithm 

Ouri Wolfson, Sushil Jajodia, Yixiu Huang 

June 1997 ACM Transactions on Database Systems (TODS), Volume 22 issue 2 
Publisher: ACM Press 
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This article addresses the performance of distributed database systems. Specifically, we 
present an algorithm for dynamic replication of an object in distributed systems. The 
algorithm is adaptive in the sence that it changes the replication scheme of the object i.e. 
the set of processors at which the object inreplicated) as changes occur in the read-write 
patern of the object (i.e., the number of reads and writes issued by each processor). The 
algorithm continuously moves the replication ... 

Keywords: computer networks, dynamic data allocation, file allocation, replicated data 


''O Fast detection of communication patterns in distributed executions | 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 
Studies on Collaborative research 

Publisher: IBM Press 

Full text available:^ pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the execution 
of the application. The visualization tool we use Is Poet, an event tracer developed at the 
University of Waterloo, However, these diagrams are often very complex and do not 
provide the user with the desired overview of the application. In our experience, such tools 
display repeated occurrences of non-trivial commun ... 

11 Proceedings of the SIGNUM conference on the programming environment for 
^ development of numerical software 

^ March 1979 ACM SIGNUM Newsletter, Volume 14 issue 1 
Publisher: ACM Press 
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^ Robert H. Thomas 

^ June 1979 ACM Transactions on Database Systems (TODS), volume 4 issue 2 
Publisher: ACM Press 

Full text available- 1 ?| pdf(2.32 MB) Additional Information: full citation , abstract , references , citings , index 

A "majority consensus" algorithm which represents a new solution to the update 
synchronization problem for multiple copy databases is presented. The algorithm embodies 
distributed control and can function effectively in the presence of communication and 
database site outages. The correctness of the algorithm is demonstrated and the cost of 
using it is analyzed. Several examples that illustrate aspects of the algorithm operation are 
included in the Appendix. 

Keywords: clock synchronization, computer networks, concurrency control, distributed 
computation, distributed control, distributed databases, multiprocess systems, update 
synchronization 


Are quorums an alternative for data replication? 
Ricardo Jimenez-Peris, M. Patino-Martmez, Gustavo Alonso, Bettina Kemme 
September 2003 ACM Transactions on Database Systems (TODS), volume 28 issue 3 

Publisher: ACM Press 

Full text available: ^ pdf(2.23 MB) Additional Information: full citation , abstract , references , index terms 

Data replication is playing an increasingly important role in the design of parallel 
Information systems. In particular, the widespread use of cluster architectures often 
requires to replicate data for performance and availability reasons. However, maintaining 
the consistency of the different replicas is known to cause severe scalability problems. To 
address this limitation, quorums are often suggested as a way to reduce the overall 
overhead of replication. In this article, we analyze several qu ... 

Keywords: Data replication, availability, distributed transactions., quorums, scalability 
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^ Hasanat M. Dewan, Salvatore J. Stolfo, Mauricio Hernandez, Jae-Jun Hwang 

May 1994 ACM SIGMOD Record , Proceedings of the 1994 ACM SIGMOD international 

conference on Management of data SIGMOD '94, Volume 23 issue 2 
Publisher: ACM Press 
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Full text available: TO pdf(1.37 MB) ^ 

terms 

Expert Databases are environments that support the processing of rule programs against a 
disk resident database. They occupy a position Intermediate between active and deductive 
databases, with respect to the level of abstraction of the underlying rule language. The 
operational semantics of the rule language influences the problem solving strategy, while 
the architecture of the processing environment determines efficiency and scalability. In this 
paper, we present elements of the ... 

Paradigms for process interaction in distributed programs 
A\ Gregory R. Andrews 

^ March 1991 ACM Computing Surveys (CSUR), volume 23 issue i 
Publisher: ACM Press 


Full text available: ^ pdf(3.77 MB) 
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terms , review 


Distributed computations are concurrent programs in which processes communicate by 
message passing. Such programs typically execute on network architectures such as 
networks of workstations or distributed memory parallel machines (I.e., multicomputers 
such as hypercubes). Several paradigms— examples or models— for process interaction in 
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distributed computations are described. Tliese include networks of filters, clients, and 
servers, heartbeat algorithms, probe/echo algorithms, broa ... 

Keywords: clients and servers, distributed and parallel algorithms, distributed 
programming, distributed programming methods, heartbeat algorithms, networks of 
filters, patterns for interprocess communication, probe/echo algorithms, replicated 
servers, token-passing algorithms 


''^ Management of a remote backup copy for disaster recovery 

Richard P. King, Nagul Halim, Hector Garcia-Molina, Christos A. Polyzois 
May 1991 ACM Transactions on Database Systems (TODS), volume 16 issue 2 

Publisher: ACM Press 

Full text available* IS Ddf(2 48 MB) Additional Information: full citation , abstract , references , citings , index 
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A remote backup database system tracks the state of a primary system, taking over 
transaction processing when disaster hits the primary site. The primary and backup sites 
are physically isolated so that failures at one site are unlikely to propogate to the other. 
For correctness, the execution schedule at the backup must be equivalent to that at the 
primary. When the primary and backup sites contain a single processor, it is easy to 
achieve this property. However, this is harder to do when ... 

Keywords: database initialization, hot spare, hot standby, remote backup 


17 GPGPU: general purpose computation on graphics hardware B 
David Luebke, Mark Harris, Jens Kruger, Tim Purcell, Naga Govindaraju, Ian Buck, Cliff 
Woolley, Aaron Lefohn 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes GRAPH 
'04 

Publisher: ACM Press 

Full text available: ^pdf(63.03 MB) Additional Information: full citation , abstract 

The graphics processor (GPU) on today's commodity video cards has evolved into an 
extremely powerful and flexible processor. The latest graphics architectures provide 
tremendous memory bandwidth and computational horsepower, with fully programmable 
vertex and pixel processing units that support vector operations up to full IEEE floating 
point precision. High level languages have emerged for graphics hardware, making this 
computational power accessible. Architecturally, GPUs are highly parallel s ... 

18 Practical byzantine fault tolerance and proactive recovery B 
Miguel Castro, Barbara Liskov 

November 2002 ACM Transactions on Computer Systems (TOCS), volume 20 issue 4 
Publisher: ACM Press 
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Full text available: ™pdf(L63_MB) : ' 
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Our growing reliance on online services accessible on the Internet demands highly 
available systems that provide correct service without interruptions. Software bugs, 
operator mistakes, and malicious attacks are a major cause of service interruptions and 
they can cause arbitrary behavior, that Is, Byzantine faults. This article describes a new 
replication algorithm, BFT, that can be used to build highly available systems that tolerate 
Byzantine faults. BFT can be used in practice to implement re ... 

Keywords: Byzantine fault tolerance, asynchronous systems, proactive recovery, state 
machine replication, state transfer 
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^ Divyakant Agrawal, Manhol Choy, Hong Va Leong, Ambuj K. Singh 

^ August 1994 Proceedings of the thirteenth annual ACM symposium on Principles of 
distributed computing 
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20 Im plennentina sequentially consistent shared objects using broadcast and point-to- 
point comnnunication 

Alan Fekete, M. Frans Kaashoek, Nancy Lynch 
January 1998 Journal of the ACM (JACM), Volume 45 issue i 
Publisher: ACM Press 

Full text available* mi Ddf(257 13KB) Information: full citation , abstract , references , citings , index 
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This paper presents and proves correct a distributed algorlthnn that Implements a 
sequentially consistent collection of shared read/update objects. This algorithm is a 
generalization of one used in the Orca shared object system. The algorithm caches objects 
in the local memory of processors according to application needs; each read operation 
accesses a single copy of the object, while each update accesses all copies. The algorithm 
uses broadcast communication when it sends messages to repllc ... 

Keywords: Orca programming language, distributed shared memory, formal methods, 
input/output automata, ordered multicast, replicated data 
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